NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Accelerating optimization over the space of probability measures

Chen, Shi; Li, Qin; Tse, Oliver; Wright, Stephen J (February 2025, Journal of machine learning research)

Free, publicly-accessible full text available February 1, 2026
Accelerating optimization over the space of probability measures

Chen, Shi; Li, Qin; Tse, Olver; Wright, Stephen J (February 2025, Journal of machine learning research)

Free, publicly-accessible full text available February 1, 2026
Optimal Rates for Robust Stochastic Convex Optimization

https://doi.org/10.4230/LIPIcs.FORC.2025.9

Gao, Changyu; Lowy, Andrew; Zhou, Xingyu; Wright, Stephen J (January 2025, Schloss Dagstuhl – Leibniz-Zentrum für Informatik)
Bun, Mark (Ed.)
Machine learning algorithms in high-dimensional settings are highly susceptible to the influence of even a small fraction of structured outliers, making robust optimization techniques essential. In particular, within the ε-contamination model, where an adversary can inspect and replace up to an ε-fraction of the samples, a fundamental open problem is determining the optimal rates for robust stochastic convex optimization (SCO) under such contamination. We develop novel algorithms that achieve minimax-optimal excess risk (up to logarithmic factors) under the ε-contamination model. Our approach improves over existing algorithms, which are not only suboptimal but also require stringent assumptions, including Lipschitz continuity and smoothness of individual sample functions. By contrast, our optimal algorithms do not require these stringent assumptions, assuming only population-level smoothness of the loss. Moreover, our algorithms can be adapted to handle the case in which the covariance parameter is unknown, and can be extended to nonsmooth population risks via convolutional smoothing. We complement our algorithmic developments with a tight information-theoretic lower bound for robust SCO.
more » « less
Full Text Available
Complexity of a projected Newton-CG method for optimization with bounds

https://doi.org/10.1007/s10107-023-02000-z

Xie, Yue; Wright, Stephen J. (July 2023, Mathematical Programming)

Full Text Available
A Line-Search Descent Algorithm for Strict Saddle Functions with Complexity Guarantees

O'Neill, Michael; Wright, Stephen J. (January 2023, Journal of machine learning research)

Full Text Available
Manifold Learning and Nonlinear Homogenization

https://doi.org/10.1137/20M1377771

Chen, Shi; Li, Qin; Lu, Jianfeng; Wright, Stephen J. (September 2022, Multiscale Modeling & Simulation)

Full Text Available
Inexact Newton-CG algorithms with complexity guarantees

https://doi.org/10.1093/imanum/drac043

Yao, Zhewei; Xu, Peng; Roosta, Fred; Wright, Stephen J; Mahoney, Michael W (August 2022, IMA Journal of Numerical Analysis)

Abstract We consider variants of a recently developed Newton-CG algorithm for nonconvex problems (Royer, C. W. & Wright, S. J. (2018) Complexity analysis of second-order line-search algorithms for smooth nonconvex optimization. SIAM J. Optim., 28, 1448–1477) in which inexact estimates of the gradient and the Hessian information are used for various steps. Under certain conditions on the inexactness measures, we derive iteration complexity bounds for achieving $$\epsilon $$-approximate second-order optimality that match best-known lower bounds. Our inexactness condition on the gradient is adaptive, allowing for crude accuracy in regions with large gradients. We describe two variants of our approach, one in which the step size along the computed search direction is chosen adaptively, and another in which the step size is pre-defined. To obtain second-order optimality, our algorithms will make use of a negative curvature direction on some steps. These directions can be obtained, with high probability, using the randomized Lanczos algorithm. In this sense, all of our results hold with high probability over the run of the algorithm. We evaluate the performance of our proposed algorithms empirically on several machine learning models. Our approach is a first attempt to introduce inexact Hessian and/or gradient information into the Newton-CG algorithm of Royer & Wright (2018, Complexity analysis of second-order line-search algorithms for smooth nonconvex optimization. SIAM J. Optim., 28, 1448–1477).
more » « less
Full Text Available
Adversarial classification via distributional robustness with Wasserstein ambiguity

https://doi.org/10.1007/s10107-022-01796-6

Ho-Nguyen, Nam; Wright, Stephen J. (April 2022, Mathematical Programming)

Abstract We study a model for adversarial classification based on distributionally robust chance constraints. We show that under Wasserstein ambiguity, the model aims to minimize the conditional value-at-risk of the distance to misclassification, and we explore links to adversarial classification models proposed earlier and to maximum-margin classifiers. We also provide a reformulation of the distributionally robust model for linear classification, and show it is equivalent to minimizing a regularized ramp loss objective. Numerical experiments show that, despite the nonconvexity of this formulation, standard descent methods appear to converge to the global minimizer for this problem. Inspired by this observation, we show that, for a certain class of distributions, the only stationary point of the regularized ramp loss minimization problem is the global minimizer.
more » « less
Low-Rank Approximation for Multiscale PDEs

https://doi.org/10.1090/noti2488

Chen, Ke; Chen, Shi; Li, Qin; Lu, Jianfeng; Wright, Stephen J (June 2022, Notices of the American Mathematical Society)

Full Text Available
BOME! Bilevel Optimization Made Easy: A Simple First-Order Approach

Liu, Bo; Ye, Mao; Wright, Stephen J.; Stone, Peter; Liu, Qiang (January 2022, Advances in neural information processing systems)

Full Text Available

« Prev Next »

Search for: All records